On Improving Dissimilarity-Based Classifications Using a Statistical Similarity Measure

نویسندگان

  • Sang-Woon Kim
  • Robert P. W. Duin
چکیده

The aim of this paper is to present a dissimilarity measure strategy by which a new philosophy for pattern classification pertaining to dissimilaritybased classifications (DBCs) can be efficiently implemented. In DBCs, classifiers are not based on the feature measurements of individual patterns, but rather on a suitable dissimilarity measure among the patterns. In image classification tasks, such as face recognition, one of the most intractable problems is the distortion and lack of information caused by the differences in illumination and insufficient data. To overcome the above problem, in this paper, we study a new way of measuring the dissimilarity distance between two images of an object using a statistical similarity metric, which is measured based on intra-class statistics of data and does not suffer from the insufficient number of the data. Our experimental results, obtained with well-known benchmark databases, demonstrate that when the dimensionality of the dissimilarity representation has been appropriately chosen, DBCs can be improved in terms of classification accuracies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Imbalanced data classification accuracy by using Fuzzy Similarity Measure and subtractive clustering

 Classification is an one of the important parts of data mining and knowledge discovery. In most cases, the data that is utilized to used to training the clusters is not well distributed. This inappropriate distribution occurs when one class has a large number of samples but while the number of other class samples is naturally inherently low. In general, the methods of solving this kind of prob...

متن کامل

Improved K-Modes for Categorical Clustering Using Weighted Dissimilarity Measure

K-Modes is an extension of K-Means clustering algorithm, developed to cluster the categorical data, where the mean is replaced by the mode. The similarity measure proposed by Huang is the simple matching or mismatching measure. Weight of attribute values contribute much in clustering; thus in this paper we propose a new weighted dissimilarity measure for K-Modes, based on the ratio of frequency...

متن کامل

Complexity of European Union Languages: A comparative approach

In this article, we are studying the differences between the European Union languages using statistical and unsupervised methods. The analysis is conducted in the different levels of language: the lexical, morphological and syntactic. Our premise is that the difficulty of the translation could be perceived as differences or similarities in different levels of language. The results are compared ...

متن کامل

Efficient Clustering of High Dimensional Datasets with Multi Viewpoint Based Similarity Measure

Many important real time applications involve clustering large datasets. Dataset can be large if there are a large number of elements in the data set, each element can have many features and there can be many clusters to discover. Recent advances in clustering algorithms have been addressed these datasets issues partially. However, there has been much less work on methods of efficiently cluster...

متن کامل

Extending k-Representative Clustering Algorithm with an Information Theoretic-based Dissimilarity Measure for Categorical Objects

This paper aims at introducing a new dissimilarity measure for categorical objects into an extension of k-representative algorithm for clustering categorical data. Basically, the proposed dissimilarity measure is based on an information theoretic definition of similarity introduced by Lin [15] that considers the amount of information of two values in the domain set. In order to demonstrate the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010